A Practical No-Linear-Regret Algorithm for Convex Games
نویسنده
چکیده
For convex games, connections between playing by no-regret algorithms and playing equilibrium strategies have previously been made for Φregret, a generalization of external regret [5]. In particular, Gordon et al. present a no-Φ-regret algorithm for several different classes of transformations Φ [4]. In this paper, we instantiate the algorithm for the class of linear transformations using a variety of optimization techniques and give experimental results on several games including Indian poker, a simple but substantially large-scale variant of poker. Our results show that both no-external-regret and no-linear-regret algorithms can achieve better regret performances than what the current theory guarantees. To the best of our knowledge, this is the first work empirically demonstrating the benefits of a no-Φ-regret algorithm for general convex games where Φ is stronger than external.
منابع مشابه
Verified Learning Without Regret From Algorithmic Game Theory to Distributed Systems With Mechanized Complexity Guarantees
Multiplicative Weights (MW) is a simple yet powerful algorithm for learning linear classifiers, for ensemble learning à la boosting, for approximately solving linear and semidefinite systems, for computing approximate solutions to multicommodity flow problems, and for online convex optimization, among other applications. Recent work in algorithmic game theory, which applies a computational pers...
متن کاملNo-Regret Learnability for Piecewise Linear Losses
In the convex optimization approach to online regret minimization, many methodshave been developed to guarantee a O(√T ) bound on regret for subdifferentiableconvex loss functions with bounded subgradients, by using a reduction to linearloss functions. This suggests that linear loss functions tend to be the hardest onesto learn against, regardless of the underlying d...
متن کاملThe convex optimization approach to regret minimization
A well studied and general setting for prediction and decision making is regret minimization in games. Recently the design of algorithms in this setting has been influenced by tools from convex optimization. In this chapter we describe the recent framework of online convex optimization which naturally merges optimization and regret minimization. We describe the basic algorithms and tools at the...
متن کاملDeep Online Convex Optimization with Gated Games
Methods from convex optimization are widely used as building blocks for deep learning algorithms. However, the reasons for their empirical success are unclear, since modern convolutional networks (convnets), incorporating rectifier units and max-pooling, are neither smooth nor convex. Standard guarantees therefore do not apply. This paper provides the first convergence rates for gradient descen...
متن کاملAn Online Convex Optimization Approach to Blackwell's Approachability
The notion of approachability in repeated games with vector payoffs was introduced by Blackwell in the 1950s, along with geometric conditions for approachability and corresponding strategies that rely on computing steering directions as projections from the current average payoff vector to the (convex) target set. Recently, Abernethy, Batlett and Hazan (2011) proposed a class of approachability...
متن کامل